Minimum Description Length Recurrent Neural Networks
نویسندگان
چکیده
Abstract We train neural networks to optimize a Minimum Description Length score, that is, balance between the complexity of network and its accuracy at task. show optimizing this objective function master tasks involving memory challenges go beyond context-free languages. These learners languages such as anbn, anbncn, anb2n, anbmcn +m, they perform addition. Moreover, often do so with 100% accuracy. The are small, their inner workings transparent. thus provide formal proofs perfect holds not only on given test set, but for any input sequence. To our knowledge, no other connectionist model has been shown capture underlying grammars these in full generality.
منابع مشابه
A Temporal Minimum Description Length Policy for Evolving Neural Networks
One of the most important issues for computational methods is their time complexity. This paper introduces a temporal MDL (minimum description length) policy for evolving neural networks based on their execution time on the hosting hardware. Temporal MDL implements an adaptive selection pressure based on the actual processing time of the evolving solutions and thus favors creation of faster, mo...
متن کاملInformation Geometry and Minimum Description Length Networks
We study parametric unsupervised mixture learning. We measure the loss of intrinsic information from the observations to complex mixture models, and then to simple mixture models. We present a geometric picture, where all these representations are regarded as free points in the space of probability distributions. Based on minimum description length, we derive a simple geometric principle to lea...
متن کاملMinimum Translation Modeling with Recurrent Neural Networks
We introduce recurrent neural networkbased Minimum Translation Unit (MTU) models which make predictions based on an unbounded history of previous bilingual contexts. Traditional back-off n-gram models suffer under the sparse nature of MTUs which makes estimation of highorder sequence models challenging. We tackle the sparsity problem by modeling MTUs both as bags-of-words and as a sequence of i...
متن کاملMinimum Description Length Criterion
he intelligibility of speech in communication systems is generally reduced by interfering noise. This interference, which can take the form of environmental noise, reverberation, competing speech, or electronic channel noise, reduces intelligibility by masking the signal of interest. The reduction in intelligibility is particularly troublesome for listeners with hearing impairments, who have gr...
متن کاملMinimum Description Length Principle
The minimum description length (MDL) principle states that one should prefer the model that yields the shortest description of the data when the complexity of the model itself is also accounted for. MDL provides a versatile approach to statistical modeling. It is applicable to model selection and regularization. Modern versions of MDL lead to robust methods that are well suited for choosing an ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2022
ISSN: ['2307-387X']
DOI: https://doi.org/10.1162/tacl_a_00489